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A dynamical theory for the evolution of the genetic code is presented, which accounts for its 
universality and optimality. The central concept is that a variety of collective, but non-Darwinian, 
mechanisms likely to be present in early communal life generically lead to refinement and selection of 
innovation-sharing protocols, such as the genetic code. Our proposal is illustrated using a simplified 
computer model, and placed within the context of a sequence of transitions that early life may have 
made, prior to the emergence of vertical descent. 



The genetic code could well be optimized to a greater 
extent than anything else in biology, yet is generally re- 
garded as the biological element least capable of evolving. 

There would seem to be four reasons for this para- 
doxical situation, all of which reflect the reductionist 
molecular perspective that so shaped biological thought 
throughout the 20th century. First, the basic explanation 
of gene expression appears to lie in its evolution, and not 
primarily in the specific structural or stereochemical con- 
siderations that are sufficient to account for gene replica- 
tion. Second, the problem's motto, "genetic code", is a 
misnomer that makes the codon table the defining issue 
of gene expression. A satisfactory level of understanding 
of the gene should provide a unifying account of replica- 
tion and expression as two sides of the same coin. The ge- 
netic code is merely the linkage between these two facets. 
Thus, and thirdly, the assumption that the code and the 
decoding mechanism are separate problems, individually 
solvable, is a reductionist fallacy that serves to deny the 
fundamental biological nature of the problem. Finally, 
the evolutionary dynamic that gave rise to translation is 
undoubtedly non-Darwinian, to most an unthinkable no- 
tion that we now need to entertain seriously. These four 
considerations structure the approach we take in this pa- 
per. 

To this point in time, biologists have seen the univer- 
sality of the code as either a manifestation of the Doctrine 
of Common Descent or simply as a "frozen accident" . 
Viewing universality as following from common descent 
renders unthinkable the notion explored here that a uni- 
versal code may be a necessary pre-condition for common 
ancestry — indeed even for life as we know it. We will ar- 
gue in this paper — a maturation of the earlier concept of 
the progenoteQ — that the very fact of the code's evolv- 
ability, together with the details of its internal structure, 
provide strong clues to the nature of early life, and in 
particular its essential communal character Q. 

Beyond the code's universality we have very few clues 



to guide us in trying to understand its evolution and that 
of the underlying decoding mechanism. The principal 
ones again are properties of the code itself; specifically, 
the obvious structure of the codon table. The table pos- 
sesses (at least) two types of order: synonym order and 
relatedness order. The first is the relatedness of codons 
assigned to the same amino acid; the second the related- 
ness of codons assigned to related amino acids. Related- 
ness among the amino acids is context dependent, and in 
the context of the codon table could a priori reflect al- 
most anything about the amino acids: their various prop- 
erties, either individually or in combination; the several 
macromolecular contexts in which they are found, such 
as protein structure, the translation mechanism, the evo- 
lution of translation; or the pre-translational context of 
the so-called RNA-world. Although we don't know what 
defines amino acid "similarity" in the case of the code, we 
do know one particular amino acid measure that seems to 
express it quite remarkably in the coding^ontext. That 
measure is amino acid polar requirement^, 0, Q . While 
the relatedness order of the code is marginally evident 
from simple inspection of the codon table 3, 4,0, SB) it 
is pronounced when the amino acids are represented by 
their respective polar requirements 

A major advance was provided by computer simulation 
studiesH [Hill El [Hill of the relatedness ordering of 
the amino acids over the codon table, which showed that 
the code is indeed relationally ordered, and moreover is 
optimized to near the maximum extent possible. Com- 
pared to randomly-generated codes, the canonical code is 
"one in a million" when the relatedness measure is the po- 
lar requirement. No other amino acid measure is known 
to possess this characteristic (in our opinion, the sig- 
nificance of this observation has not been adequately rec- 
ognized or pursued). These precisely-defined relatedness 
constraints in the codon table were unexpected, and still 
cry out for explanation. 

As far as interpretation goes, the optimal aspect of the 



2 



genetic code is surely a reflection of the last aspect of the 
coding problem that needs to be brought into considera- 
tion: namely, the precision or biological specificity with 
which translation functions. Precision, along with every 
aspect of the genetic code, needs to be understood as 
part of an evolutionary process. We would contend that 
at early stages in cellular evolution ambiguous transla- 
tion was tolerated (there being no alternative), and was 
an important and essential part of the evolutionary dy- 
namic (see below). What we imply by ambiguity here 
is inherent in the concept of group codon assignments, 
where a group of related codons is assigned as a whole to 
a corresponding group of related amino acids Q. From 
this flows the concept of a "statistical protein" , wherein 
a given gene can be translated not into a unique protein, 
but instead into a family of related protein sequences. 
Note that we do not say that these are an approximation 
to a perfect translation of the gene, thereby implying 
that these sequences are in some sense erroneous. Early 
life did not require a refined level of tolerance, and so 
there was no need for a perfect translation. Ambiguity 
is therefore not the same thing as "error" . 

The phylogenetic expression of ambiguity is reticulate 
evolution. In reticulate evolution, there is no unique no- 
tion of genealogical descent: genetic content can be dis- 
tributed collectively. Accordingly, as we now turn the 
emphasis away from the documentation of the static fea- 
tures of the genetic code, and towards their evolutionary 
origins, we must necessarily invoke an evolutionary dy- 
namic distinct from that identified originally by Darwin. 
This dynamic can be seen as a kind of biological game 
in which both the players and the rules of play are unfa- 
miliar, at least in the non-microbial world. The players 
are cell-like entities still in early stages of their evolu- 
tions. The evolutionary dynamic (the "rules") involve 
communal descent. The key element in this dynamic 
is innovation-sharing, an evolutionary protocol whereby 
descent with variation from one "generation" to the next 
is not genealogically traceable, but is a descent of a cel- 
lular community as a whole. Even if an organismal an- 
cestry were to some extent traceable, it would have no 
significance, because it is the community as a unit, not 
the individual organismal lineages therein, that vary in 
descent. 

The purpose of this paper is to show that cvolvability, 
universality and optimality can all be understood natu- 
rally and comprehensively, but not within a framework 
of strictly vertical evolution. Specifically, we will herein 
model the evolution of translation, the codon table, the 
constraints therein, the universality of the code and the 
decoding mechanism, not as a sum of parts, but as a 
whole. This paper is the first in a series that attempts 
to present such a comprehensive model: subsequent ar- 
ticles will focus more on the decoding mechanism, but 
here we focus primarily on innovation-sharing. The cen- 
tral conjecture in our model is that innovation sharing, 
which involves horizontal transfer of genes and perhaps 
other complex elements among the evolving entities — a 



dynamic far more rampant and pervasive than our cur- 
rent perception of horizontal gene transfer — is required 
to bring the evolving translation apparatus, its code and 
by implication the cell itself to their current condition. 

Our point of view alleviates the need for any assump- 
tion of a unique common ancestor. We argue that the 
universality of the code is a generic consequence of early 
communal evolution mediated by horizontal gene trans- 
fer (HGT), and that HGT enhances optimality. Our ar- 
guments are backed up by computer simulation studies, 
which are necessary in order to probe the complex inter- 
actions between the variety of collective mechanisms that 
we shall present. We show that there are virtuous cycles 
of cooperativity: 1) the more similar the genetic codes 
are, the greater the intensity of horizontal gene trans- 
fer is, and the stronger is the tendency for codes to be- 
come more similar; 2) horizontal gene transfer helps the 
codes to optimize, and optimization enforces universal- 
ity and compatibility between translational machineries. 
These cooperative dynamics arise because of the dual role 
played by the genetic code: it is not only a protocol for 
encoding amino acid sequences in the genome but also an 
innovation-sharing protocol. Here we identify two syner- 
gistically interacting mechanisms for the emergence of a 
universal innovation-sharing protocol: dynamic compe- 
tition between protocols favoring the popular ones and 
effective attraction of codes due to exchange of protein 
coding regions. Other mechanisms are also possible, and 
will be discussed elsewhere. 

If Darwin had been a microbiologist, he surely would 
not have pictured a "struggle" for existence as "red in 
tooth and claw" . Our view of competition in a commu- 
nal world as a dynamical process is very different from the 
widely-understood notion of Darwinian evolution. "Sur- 
vival of the fittest" literally implies that there can only 
be one winner from the forces of selection, whereas in a 
communal world, the entire distributed community ben- 
efits and its structure becomes modified by the forces of 
a selection that is an inherently biocomplex phenomenon 
involving the dynamics between the community elements 
and the interaction with the environment. The most gen- 
eral sense in which we mean competition in this article is 
the complex dynamical rearrangement of the community 
structure. 

Our framework fits naturally the recently pro- 
posed picture that early evolution was dominated by 
horizontal gene transfer, as evidenced by detailed 
phvlo genetic 1 15j|. biochemical^] and structural^ anal- 
yses of the aminoacyl-tRNA synthetases. The broader 
implication of this scenario is that innovation sharing led 
to the emergence of modern cell designs |l8j from a com- 
munal state - not a unique, shared ancestor. Such a com- 
munal state existed prior to the point of emergence of 
vertical evolution, which has been termed the Darwinian 
transition|l8|. The defining property of the communal 
state was that it was capable of tolerating and utilizing 
ambiguity, as refiected in the pervasive role of horizon- 
tal gene transfer. A Darwinian transition corresponds 
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to a state of affairs when sufficient complexity has arisen 
that the state is incapable of tolerating ambiguity, and so 
there is a distinct change in the nature of the evolution- 
ary dynamics — to vertical descent. We envision that such 
Darwinian transitions occurred in each of the three ma- 
jor lineages. The present work does not address the Dar- 
winian transition itself, but explains how the communal 
state could have arisen in the first place: in our scenario, 
it is the inevitable by-product of the establishment of an 
innovation-sharing protocol — the genetic code — leading 
to the explosive growth of complexity. Thus, we may 
speculate that the emergence of life should best be viewed 
in three phases, distinguished by the nature of their evo- 
lutionary dynamics. In the first phase, treated in the 
present paper, life was very robust to ambiguity, but 
there was no fully-unified innovation-sharing protocol. 
The ambiguity in this stage led inexorably to a dynamic 
from which a universal, and optimized innovation-sharing 
protocol emerged, through a cooperative mechanism. In 
the second phase, the community rapidly developed com- 
plexity through the frictionless exchange of novelty en- 
abled by the genetic code — a dynamic we recognize to 
be patently Lamarckianj^. With the increasing level 
of complexity there arose necessarily a lower tolerance 
of ambiguity, leading finally to a transition to a state 
wherein communal dynamics had to be suppressed and 
refinement superseded innovation. This Darwinian tran- 
sition led to the third phase, which was dominated by 
vertical descent and characterized by the slow, and tem- 
pered accumulation of complexity. 



I. UNIVERSALITY AND HGT 



Previous arguments about universality rely on the exis- 
tence of a universal common ancestor with a frozen code. 
A detailed deconstruction of such arguments is presented 
in the Supplementary Material, and further supported by 
the computer simulations presented in Section (QlJ, but 
the unambiguous conclusion is that vertical descent on 
its own is insufficient to explain the universality of the 
genetic code. Here we present an alternative: the univer- 
sality of the genetic code is a generic consequence of the 
communal evolution of early life. HGT of protein cod- 
ing regions and HGT of translational components ensures 
the emergence of clusters of similar codes and compati- 
ble translational machineries. Different clusters compete 
for niches, and due to the benefits of the communal evo- 
lution, the only stable solution of the cluster dynamics 
is universality. Within clusters, concerted optimization 
of codes is possible. These mechanisms are consistent 
with two macroevolutionary scenarios: 1) the code stayed 
nearly universal at all times, 2) The codes diverged at 
first but then gradually became universal. 



A. Competition between innovation pools 

One of the advantages of communal evolution is 
that universally good traits and refinements can spread 
through HGT to organisms occupying different niches, 
preserving their diversity. In a world increasingly domi- 
nated by protein, most innovations would involve them, 
and correspondingly HGT will be most effective between 
organisms having the same genetic code. In this way, the 
organisms sort into communities sharing related genetic 
codes. A single code community can span cells adapted 
to different niches and with different organization. 

The larger the community and diversity of organisms 
sharing the same code, the larger is the pool of protein 
innovations accessible to everyone. This leads to faster 
evolution among the larger communities than the smaller 
ones, and therefore a greater potential to invade niches 
occupied by organisms with different incompatible ge- 
netic codes. With this dynamics larger communities will 
tend to become even larger at the expense of smaller 
ones. The only stable solution is a universal genetic code. 
Thus, it is not better genetic codes that give an advan- 
tage but more common ones. 

The elementary step in this process is the overtaking 
of an occupied niche by the descendants of an organism 
with a different genetic code. If two groups of organisms 
compete with each other, the one that has access to more 
innovations (the one belonging to the larger community 
of common/compatible genetic codes) will on average out 
compete the other. In contrast to the case with only ver- 
tical evolution, there is an active feedback loop, driven by 
innovation sharing through HGT, which not only singles 
out the genetic code from all other properties of a cell, 
but also provides a mechanism that drives competition 
between codes. 

This mechanism — referred to below as "competition 
between innovation pools" or CIP — assumes that the pro- 
tocols are fixed. But how did the protocols themselves 
emerge and evolve? How can a protocol be upgraded 
without destroying it? 



B. Code attraction and optimization due to HGT 
of protein coding regions 

A population of organisms occupying a niche is subject 
to spontaneous code mutations, and is bombarded by for- 
eign genetic material from organisms occupying different 
niches. Horizontally transferred genes can be useful for 
the recipient even if the donor has a (somewhat) different 
code. For example, the codon usage (e.g. synonym codon 
usage frequency) of a transferred gene is adapted to the 
donor code and therefore different from that of the recip- 
ient. Correspondingly, there will be indirect pressure for 
the recipient code to readjust itself to make a better use 
of the new gene. 

We expect that the code response would involve several 
characteristic time scales. On the long scale, the direction 
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of change is to reduce ambiguity, but on the short scale 
the code must be able to tolerate a greater level of ambi- 
guity while ingesting new genes. The means available to 
the cell to detune the level of precision of translation may 
be considered to be of two essential origins: those inter- 
nal to the cell, and those which arc communal, reflecting 
the influence of the environment and neighbouring cells. 
Mechanisms internal to the cell include change in tRNA 
expression levels and detuning of the ribosomal machin- 
ery itself, as is known to occur through variations in Mg 
ion concentration, antibiotics, and structural mutations. 
Communal mechanisms are likely to involve the import 
of tRNA from other organisms. The increase on the short 
time scale in translational ambiguity is compensated for 
on the same time scale by the beneficial effects of the 
new gene. Eventually the codon and amino acid usage of 
the newly transferred segment will equilibrate with the 
rest of the genome and the indirect pressure of the donor 
code on the recipient code will disappear, while leaving 
behind its accumulated effects. 

In somewhat more detail, these arguments indicate 
that, after a HGT event, the genetic code of the donor 
influences the genetic code of the acceptor. Given an 
alien gene, the host-alien gene system undergoes a cycli- 
cal dynamical process leading to full utilization of the 
new gene. In one part of the cycle, the host detunes 
its own code for purposes of recognizing the alien code; 
an example of such a detuning process has been docu- 
mented in streptomycin-dependent mutants l20l l2lj| and 
ribosomal ambiguity mutants |i22. .23i] in bacteria. In the 
other part of the cycle, the alien gene codons are mu- 
tated to conform to the host code. This process results 
in the detection with greater precision of the alien sig- 
nal. A snapshot of this process would reveal a genome 
as a mosaic of horizontally transferred fragments from 
other genomes with different characteristic patterns of 
codon usage. However, these are only the tip of the ice- 
berg: beyond codon usage are the subtle but important 
changes in amino-acyl tRNA synthetase precision and the 
ambiguity level of the translational mechanism itself. 

The interaction between the genetic codes is attractive. 
Typically, the closer the translation of a foreign coding 
region is to that in the donor, the higher is the probabil- 
ity that it is functional. Therefore, the selective pressure 
will be to change codons of the recipient code in the di- 
rection of the donor code, even if only in a probabilistic 
fashion. The dynamical outcome of this attraction must 
be uniformization. This expectation is confirmed by the 
computer simulations presented below. 

HGT requires that the genetic codes of the host and 
the recipient are sufficiently similar, but how similar is 
sufficient depends on the nature of the proteins and the 
overall accuracy of decoding. There are strong reasons 
to believe that the more primitive the code of the donor 
is, the greater is the genetic code distance over which 
HGT is possible. This is because the tolerance of the 
proteins to errors in their primary structure is coadapted 
to the error rates of the translational machinery. A cell 



with a non-optimal code cannot afford very capricious 
and therefore highly fine-tuned proteins because of the 
cost of discarding defective proteins. A protein that is ro- 
bust to translational errors a fortiori is also more tolerant 
to translation with a different code. Conversely, the less 
optimized the recipient code is, the more error-tolerant 
its proteins are, and therefore the less harmful will be 
the effect on the established genes of a code change in 
the direction of the donor code. This has the important 
consequence that in the initial stages of the genetic code 
evolution, when the diversification tendency of codes was 
strongest, HGT was possible and must have been exten- 
sive despite the presence of many different codes. 



C. HGT of translational components 

To this point, our discussion has managed to avoid 
the specifics of how the genetic code is implemented in 
hardware, as it were. However, we cannot ignore the 
possibility that the translational components themselves 
benefited from HGT, and we now turn to this briefly. A 
fuller account will be given elsewhere. 

The genetic code is a representation of a family of 
modules, which are universal across all organisms, and 
are specified by the mechanisms of translation, such 
as tRNAs and charging enzymes (aminoacyl-tRNA syn- 
thetases in a modern day setting). The task of improving 
translation and the code is also universal, i.e. largely in- 
sensitive to the niches organisms are occupying. So is it 
possible that HGT of translational components played 
an important role in the evolution of the codes? Is 
there any significance in the functional separation be- 
tween the translational machinery (the ribosome) and 
the code speciflcators (tRNAs and charging enzymes)? 
Imagine for simplicity a situation in which organisms oc- 
cupying diverse niches have the same malleable genetic 
code and ensembles of tRNAs. The discovery of a tRNA 
modification that changes the code and increases its op- 
timality (and therefore the efficiency of translation) in 
one organism will also be beneficial for organisms in the 
other niches, due to the universal benefit of optimality. 
Therefore, a spread of the discovery is beneficial to all 
recipients, and can be assumed to occur through various 
HGT mechanisms including via active elements such as 
viruses and plasmids. 

Therefore, if the spread through HGT is rapid com- 
pared to innovations, a core of organisms having the same 
genetic code can maintain its integrity while evolving to- 
wards optimality. Notice that this mechanism does not 
rely on common ancestry and preserves the diversity of 
the organisms. Moreover, this mechanism is distinct from 
any survival of any "fittest" species. In the absence of an 
attractive force that restrains deviant codes, this core of 
organisms would become depleted, if there was any cir- 
cumstance that prevented a code update from invading 
specific populations. If the depletion is slow enough, the 
deviants will be at a communal disadvantage and disap- 
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pear as described in section \LKI The depletion mecha- 
nism will compete against an expansion of the core due 
to the benefit of a common protocol shared by a large 
population. 



D. Diversification of the translation mechanism 

The special role of the genetic code as an innovation- 
sharing protocol leads to a possible observational conse- 
quence. In a core community of organisms that is in the 
process of code optimization, the compatibility of code 
specificators is enforced. Once the optimization of the 
genetic code is complete, there is no pressure to main- 
tain compatibility. Therefore the "freezing" of the uni- 
versal genetic code could trigger the radiation of the un- 
derlying translational machineries. So, even if translation 
emerged earlier than the other basic cellular systems, but 
the optimization of the code took an extended time, the 
translational componentry would have diversified less. 
This is consistent with the observation that the transla- 
tion mechanism is more conserved evolutionary than the 
replication and transcription ones. Although we do not 
have a complete understanding of the Darwinian transi- 
tion 18], our argument suggests that code universality 
and optimality were necessary but not sufficient mecha- 
nisms for the transition to vertical evolution. 



E. Interactions between HGT mechanisms 

The different collective mechanisms enabled by HGT 
and outlined above are also capable of synergistically in- 
teracting with each other. 

We saw above that the evolutionary expansion of the 
most popular cluster of codes provides the necessary sup- 
port for the maintenance of an otherwise weakly deplet- 
ing universal core. The opposite is also true. The CIP 
mechanism is ineffective if there are no clusters of suffi- 
cient size on which it can operate. The establishment of 
such clusters is greatly facilitated by the HGT of code 
specificators and protein coding regions. Distribution of 
modules enforces modularity that in turn enforces the 
distribution of modules. Similarly, exchange of protein 
coding regions enforces universality, thus making it eas- 
ier to exchange genes. Therefore, there are positive feed- 
back loops that provide at least local stability to the pro- 
tocols, and turn them into effective degrees of freedom at 
a longer time scale. The global stability and universality 
is then guaranteed by the "winner takes all" nature of 
the CIP. 

HGT of code specificators and protein coding regions 
interact not only through the CIP mechanism but di- 
rectly as well. If an organism obtains a gene from an- 
other niche, its place in the ecosystem is such that it has 
potential contact with the genetic material of the donor. 
Therefore, the recipient has a better than random chance 
to obtain the right code specificator from the donor as 



well, before the special codon usage of a recently acquired 
trait drifts. The exchange of code specificators provides 
a channel through which codes can become more similar 
in response to the attraction of codes due to exchange of 
protein coding regions. 

In summary, it is the interaction between the different 
mechanisms outlined above that makes the emergence 
and maintenance of universality robust. At the same 
time, due to the complexity of the problem it is useful to 
study the different components in isolation as well. 



F. HGT and the observed statistical properties of 
the genetic code 

So far, we have argued that HGT and the special role of 
the genetic code as an innovation-sharing protocol allevi- 
ate the conceptual difficulties in understanding the simul- 
taneous universality and evolvability of the genetic code. 
Does this improved understanding help us explain some 
of the statistical features of the modern genetic code? 
And how can we expose the signatures of the above mech- 
anisms that are buried in the functional and structural 
design of the translational system and its phylogenetic 
variations? 

To address this, one needs to complement the above 
generic mechanisms with insight about the elementary 
evolutionary changes of the genetic code. Our goal in the 
remainder of this paper is to attempt to identify robust 
or generic statistical properties of translation that arise 
from our proposed evolutionary mechanisms, but which 
are relatively insensitive to fine details. To begin, we 
model the code attraction mechanism and ask: what is 
the effect of HGT on the optimality of the genetic code? 

We employ genetic code dynamics similar to that first 
introduced by Sella and ArdeU [HIH HI]. The main 
feature is the coevolution between the genetic code and 
codon usage at different functional sites. The code deter- 
mines the codon usage at mutation selection equilibrium. 
In turn, the codon usage determines the fitness costs or 
benefits of the accessible code changes, thus guiding the 
code's evolution. Code changes that are beneficial given 
the typical codon usage of a population can invade it. 
To account for HGT we couple the evolution of differ- 
ent codes by postulating that a fraction of each genome 
consists of pieces coming from other genomes. 

The virtue of Sella and Ardell's model is that it is a 
closed model of the evolution of the genetic code, and 
shows that the evolvability barrier is surmountable in a 
protein dominated world. Its shortcoming is that it does 
not address the fact that translation is a dynamical pro- 
cess, with competition between its various components. 
This means that the Sella and ArdeU model on its own is 
not adequate to identify generic statistical signatures of 
the evolutionary mechanisms of translation, because the 
statistical properties of the code and the structure of the 
translational system are precisely the stable resolutions 
of the design tradeoffs and evolutionary conflicts inherent 
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to translation. 

The code attraction mechanism that we use at this 
point is also insensitive to the implementation of transla- 
tion, and so has the same shortcoming. Thus, combining 
it with Sella and Ardell's model we will still not be able 
to address all evolutionary aspects of the problem. Nev- 
ertheless, such a model, while admittedly too simple for 
our ultimate goals, can, encouragingly, still explain the 
universality and optimality of the genetic code. The only 
key aspect of translation that it is necessary to incorpo- 
rate, even if by introducing it by hand, is mistranslation. 

At a next level in the hierarchy of models, one needs 
to incorporate the tRNAs as agents of both the collective 
molecular effort of translation and the communal evolu- 
tion of the genetic codes. In contrast to the above, in 
such models mistranslation would become a dynamical 
variable that emerges from the competition itself. This 
lower level of description entails a richer suite of obser- 
vational outcomes, and provides a unique and essential 
role for the organism as a resource manager and conflict 
regulator for the various dynamical processes within it. 

II. MODEL OF CODE ATTRACTION DUE TO 
HGT 

We first describe the modeling of the genetic code and 
mistranslation errors, then define the proteome struc- 
ture and the reproductive success of genomes as a func- 
tion of their codon sequences. We show how to compute 
the probability distribution of codons at different func- 
tional sites at mutation selection equilibrium. Finally we 
present a simulation algorithm which incorporates HGT. 
Genetic code:- The genetic code is a probabilistic map 
Prob{c — » a) between codons and amino acids. The map 
is probabilistic because the charging of tRNAs with par- 
ticular amino acids and the decoding of codons through 
competition of tRNAs are probabilistic molecular events. 
Quite generally, 

Probic^a)=J2TctCtc.. (1) 
t 

where is the probability that a codon c is read by 
tRNA species t, and Cta is the probability that it is 
charged with amino acid a. Assuming one-to-one map- 
ping between tRNA species and codons, equal concen- 
trations of different tRNAs, and ignoring mischarging we 
set 

Prob{c ^ a) = ^ Tcc'Saa{c'),a, (2) 
c' 

where the sum is over the codons and Tec' — 1^/9 if c 
and c' are nearest neighbors. Tec = 1 — v and Ted — 
otherwise, with v being the mistranslation rate. (9 is the 
number of neighbors for codons consisting of three letters 
and an alphabet of size 4.) aa{c) is a map between the 
codons and amino acids, which will be referred to as the 
code. 



Genome and proteome structure:- A genome is a sequence 
of codons that is translated to an amino acid sequence. 
Each genome position x belongs to a site type s{x). A site 
type s is characterized by the fitness score Wa s of the dif- 
ferent amino acids that can be present at that site. The 
matrix W , together with the frequencies {Ls\ of the dif- 
ferent site types in the genome, constitutes the structure 
of the proteome. Assuming that amino acid substitutions 
at different genome positions have independent effects on 
fitness, we construct the proteome fitness score 

= (3) 

X 

where the product is over all genome positions and a{x) 
is the amino acid at position x. 

Codon usage:- Since different positions belonging to the 
same site type are phenotypically indistinguishable in the 
model, we can describe the genome by the matrix {ugc} 
that specifies the frequency of codon c among sites of 
type s. 

Genome fitness:- Accommodating the probabilistic na- 
ture of translation, we set the fitness of a genome to 
be the average of the proteome fitness score over many 
translations, i.e. / = {A). Since translations of different 
codons are independent 

f(^{c{x)},C0dej = = ^'s(a;),c(£!;) ■ (4) 

X X 

Putting everything together and switching from codon 
sequence to codon usage representation, we end up with 

{\ Ls Use 

Equilibration of codon usage:- Given the matrix of mu- 
tational effects {Fsc{code)} , defined above, what is the 
codon usage {msc} in an asexual population of an infinite 
size and large genomes at a mutation selection equilib- 
rium? Mutational pressure is characterized by the matrix 
Mcc' specifying the probability that codon c will mutate 
into codon c' in one generation. It is assumed indepen- 
dent of the site type and genome position. Any muta- 
tional biases could be incorporated in Med- Here we fo- 
cus on equally probable single nucleotide changes. In this 
case, M is specified by a single parameter n which is the 
probability for a mutation at a given site in one genera- 
tion. Following Sella and Ardell 23| , the codon usage at 
a site of type s is given by the eigenvector corresponding 
to the largest eigenvalue of the matrix 

Q(^j = Mcc'Fsc' . (6) 

The matrix Q refiects the application of selection followed 
by mutation. 

The parameters of the model described above are: 
- the number of site types, Na - the number of amino 
acids, the Na x Ns matrix W , a vector {Ls}, specifying 
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the relative frequencies of the different site types, the 
mutation rate ^ and the mistranslation rate v. 
Model dynamics:- Now we consider an ensemble of pop- 
ulations with different codes and present the dynamics: 

1. There are N entities, each with its own genetic code 
aa{c) and codon usage Wgc- 

2. At each step an entity, the acceptor, and K random 
donor entities are chosen at random. The acceptor 
codon usage is updated according to the rule 

{^-j^^Pk)ust + ^^PkU^st — 'Us^, (7) 

k=l k=l 

where u[f is the codon usage of donor fc, and pk 
is some measure of the compatibility between the 
donor and acceptor codes expressing the probabil- 
ity of acceptance. Here, we study the case with no 
barrier to HGT oi coding regions, i.e. pk = ^- H is 
the fraction of the acceptor genome that is a mosaic 
due to HGT. 

3. We attempt to change the code of the acceptor. 
We examine in random order the possible elemen- 
tary changes of the code until we find one that is 
acceptable or exhaust all the possibilities. We ac- 
cept a candidate change if it increases or at least 
preserves the fitness , calculated using the mosaic 
codon usage {usc} and equation|S| An elementary 
code change reassigns a single codon to a different 
amino acid. 

4. We equilibrate the acceptor codon usage by find- 
ing the eigenvectors corresponding to the largest 
eigenvalues of the matrices Q^. 

5. We repeat the cycle. 

The CIP mechanism, which clearly facilitates univer- 
sality (and given enough time generically leads to uni- 
versality), is factored out from the simulations in order 
to concentrate on the code attraction mechanism. Each 
evolving entity in the ensemble can be thought of as a dif- 
ferent "species" (or ecotype). While within each species 
the evolution proceeds through invasions of code variants 
with higher fitness, the different species are stable and 
their number is fixed, thus blocking the CIP mechanism. 



III. RESULTS 

A. Genetic code coevolution towards optimality 
and universality 

We evolved ensembles of codes with and without HGT 
and measured the time evolution of the average distance 
between codes and the distribution of optimality scores. 




5000 10000 15000 

Time slep Probabilily histogram 



FIG. 1: Communal evolution towards optimality of 80 codes 
with (blue) and without (red) HGT of coding regions. There 
is no barrier to HGT between different codes. The initial 
conditions are the same for both runs. Parameters: H — 0.4, 
= 0.99, A* = 10"'', f = 0.01. (A) Time development of 
the average amino acid distance between neighboring codons, 
a proxy for code optimality; (B) Probability distribution 
histogram of code optimality for randomly-generated codes. 
Horizontal axis is the frequency with which a given code op- 
timality occurs, vertical axis is the same as in (A); (C) Inset: 
time development of the average distance between codes. 

We compare the optimality scores with the correspond- 
ing distribution for randomly generated codes. The en- 
semble of randomly generated codes is constructed by 
assigning random amino acids to the codons. Initially all 
the codes are identical, and the initial code is generated 
by randomly assigning amino acids to the codons. 

The average code distance is obtained by consider- 
ing all pairs of entities with equal weight. The code 
distance between two entities is the Hamming distance, 
which counts the numbers of codons that code for differ- 
ent amino acids. 

We define the optimality score of a code as the aver- 
age amino acid similarity distance between neighboring 
codons 

(8) 

c c' 

where Ned is 1 if codons c and c' differ by a single letter 
and zero otherwise. S is an the amino acid similarity 
matrix defined as follows: 

5a/3 =^|ITa,,-IT^,,| . (9) 
s 

Figure n presents the simulation results for the follow- 
ing parameters: N = 80, Na = Ng — 20, 64 codons, 
fi = 10-^ 1/ = 0.01, Wai3 = with {Aa} be- 

ing uniformly distributed random numbers in the interval 
(0, 1) and (j) = 0.99. The HGT parameters are H = 0.4 
and K = 1. 
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The left panel - demonstrates that HGT of coding 
regions not only brings universality but greatly enhances 
the joint ability of the codes to optimize. Comparison 
with the distribution of optimality scores for random 
codes, right panel -^3, shows that, in the presence of 
HGT, the achieved optimality is highly significant. Thus 
in a qualitative way, we have provided a dynamical mech- 
anism that would give rise to the statistical properties of 
the genetic code identified in |^ ^ . 

The inset shown that without HGT the codes di- 
versify form each other. However, when HGT is present 
the tendency to diversify is eventually reversed and the 
codes get attracted to each other, gradually achieving 
near universality. It should be stressed that the proba- 
bility for achieving universality, in the absence of the CIP 
mechanism, depends on H, and is equal to one above a 
threshold. While _ff is a constant in this set of simula- 
tions, the discussion in 18] suggests that it is in fact a 
dynamical variable that is initially large and gradually 
decreases as better translation allows the evolution of a 
protein network with more specific interactions. 

We interpret these results as supporting two key con- 
cepts that underlie the arguments in this paper. First 
and foremost is the role of communal evolution in leading 
to a universal genetic code. Vertical or Darwinian evolu- 
tion does not lead to a reduction in the distance between 
codes. This is seen from the long time behavior of the red 
curves in^\. Only the incorporation of HGT gives rise to 
code convergence, as shown by the long time behaviour of 
the blue curves in^^: they get closer together with time. 
In a sense, a Darwinian (genealogical) evolution would 
get trapped, or perhaps, frozen into metastable states. 
Second is the role of communal evolution in leading to 
an optimal code. Vertical or Darwinian evolution gets 
frozen into non-optimal states, whereas with HGT, the 
code becomes optimized to a much greater extent. This is 
seen by comparing the final values of the Darwinian evo- 
lution (red) curves and the communal evolution (blue) 
curves with the vertical axis ofn3. Communal evolution 
results in a genetic code that is much further from the 
mean of random distributions than the results of Dar- 
winian evolution. 



IV. CONCLUSIONS 

With this work, we have revisited the largely- 
overlooked problem of genetic code universality and the 



conceptual difficulties associated with it. These difficul- 
ties can all be avoided if one takes, as we do, the stance 
that evolution was essentially communal from the very 
beginning. We have argued that there are three distinct 
stages of evolution, which we might classify as: (I) Weak 
communal evolution, which gave way via development 
of an innovation-sharing protocol and the emergence of a 
universal genetic code to (II) Strong communal evolution, 
which developed exponential complexity of genes, finally 
leading via the Darwinian transition to (HI) Individual 
evolution — vertical and so. Darwinian. 

Most of our analysis explored the transition between 
regimes I and II, through detailed consideration of the 
way in which a generalized form of HGT operating on 
long evolutionary time scales brings universality via dy- 
namic competition between a wide variety of collective 
innovation-sharing protocols. In particular, we argued 
how such protocols emerge through the important coevo- 
lutionary mechanism of code attraction, and presented a 
specific model that is capable of explaining the simulta- 
neous universality and optimality of the genetic code. 

The genetic code is an expression of the translation 
process, and therefore its state and significance reflect the 
various stages in the evolutionary development of trans- 
lation and the organization of the cell. Thus, a fuller 
account of the evolution of the genetic code requires mod- 
eling physical components of the translational apparatus, 
including the dynamics of tRNAs and the amino-acyl 
tRNA synthetases. Only with this level of description 
is it possible to address issues, such as the special role 
played by the polar requirement. This latter point is, we 
believe, an essential clue to the early evolution of trans- 
lational components, when the genetic code presumably 
had a rather different function. 

Evolution of the genetic code, translation and cellular 
organization itself follows a dynamic whose mode is, if 
anything, Lamarkian. 
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VI. SUPPLEMENTARY MATERIAL 
A. Universality without HGT 

Freezing before diversification:- A possible sequence of 
events leading to a frozen code Crick and Wong (2^ 
is that translation emerged, the code optimized itself and 
froze in a single niche, and only then did some other evo- 
lutionary transition trigger the spread and diversification 
of its organisms. The code is universal by virtue of com- 
petition and genetic drift between organisms occupying 
this single niche. Or, perhaps, in a localized ecosystem, 
there was more than one frozen code, but it was a single 
lineage that diversified, conquering a world that was ei- 
ther unoccupied or inhabited by inferior organisms lack- 
ing translation. The original ecosystem was marginalized 
and the rest of the codes were stochastically lost during 
the expansion. So, the universality of the code is a result 
of a large scale "founder" eS'ect. 

The problem with this freezing before diversification 
scenario is that it does not explain what was stopping 
the expansion of organisms endowed with some form of 
translation well before the genetic code froze. The evo- 
lution of translation and the refinement of the genetic 
code was most likely a multi-stage process that took an 
extended period of time. 

The alternative to freezing before diversification is 
freezing after diversification. Even without understand- 
ing how codes evolve, it is reasonable to assume that they 
eventually freeze. Let's look at the first moment of time 
when all organisms have frozen codes. The codes diversi- 
fied along with all other organism properties, so we have 
many somewhat different codes and perhaps even some 
organisms that do not have translation at all. How do we 
get from this situation to a universal code? We now out- 
line a variety of logically allowed possibilities, explaining 
their merits and drawbacks. 

Stochastic universality:- The universal genetic code is a 
simple consequence of the fact that after an extended pe- 
riod of time, all organisms will be descendants of a single 
organism and will inherit its genetic code. This happens 
for stochastic reasons: the descendants of an organism 
can invade a neighboring niche, and by mere chance out- 
compete the organisms that were already there. Since 
the entire phenotype space is connected, the repeated 
stochastic takeover of neighboring niches will result even- 
tually in a universe inhabited by organisms with a com- 
mon ancestor. The problem here is that because of the 
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stochastic nature of the process it will be extremely slow. 
In addition the phenotype space is constantly growing 
and it is not obvious that this slow stochastic takeover 
process can ever saturate it. 

Evolutionary transition following translation:- A uni- 
versally beneficial cellular property that emerged follow- 
ing the maturation of translation caused a giant selective 
sweep overcoming the preexisting ada pta tion of different 
organisms to different environments |28| . A candidate 
for such an event is the discovery of DNA. This is an 
attractive possibility. One of the problems is that by the 
time of maturation of translation we might already have 
different cell designs with different ecological roles and 
it is not immediately clear how even such a great inno- 
vation can overcome this. In contrast, if the code was 
already universal due to the communal evolution, pro- 
teins involved in DNA processing could have been easily 



distributed. 

Selection on optimality:- Once proteins were estab- 
lished as a major determinant of the phenotype, the qual- 
ity of the genetic code became the single most important 
contributor to fitness. The more optimized genetic codes 
out-competed less optimized ones. Related is the sug- 
gestion that optimality is linked to evolvability (2^ , and 
therefore, organisms with more optimal codes are evolu- 
tionary more successful. The problem here is that it is 
not a priori clear that we can ignore the diversity and 
competition along all the other phenotypic dimensions. 
Moreover, the more optimized the codes are, the less will 
the differences between them matter. So we would expect 
after all a diversification of the genetic codes. 

We conclude that, on rather general grounds, an expla- 
nation of code universality based on vertical evolution is 
likely to be problematic. 



